Method and apparatus for storing, searching and playing back audiovisual information and data files

ABSTRACT

Method and apparatus for storing, searching and playback of audiovisual items of information and data files, using a protocol for multimedia multiplexing and multimedia control, with a control for the multimedia information streams in a separate virtual control channel according to ITU-T H.245, with multiplexers or, respectively, demultiplexers for information according to ITU-T H.223, with a video compression and coding means or, respectively, video decompression and decoding means, e.g., according to ITU-T H.263, with an audio/speech compression and audio/speech decompression means for compression or, respectively, decompression with at least one high-compression speech compression algorithm, and with a control for the multimedia memory device in a separate logical control channel.

BACKGROUND OF THE INVENTION

The invention relates to a method and apparatus for storing, searching and playing back audiovisual information and data files. In particular, the invention relates to a multi-media memory device.

Storing and playing back items of multimedia information (video, audio, data, control information) on digital storage media (e.g., CD-ROM, optically writable hard disk, magnetic hard disk) is one of the most important functions in multimedia systems.

Although the capacity to store items of information on digital storage media is constantly growing, the large amount of data of multimedia information continues to present a massive storage problem.

A standardized method under ISO/IEC 11172 (also known as MPEG-1) enables storage of approximately one hour of audiovisual information on CD-ROM with data streams of 1.5 Mbits/s. Other, non-standardized audiovisual compression methods commonly used in PC applications also enable approximately one hour of stored audiovisual program, albeit with a worse image and tone quality.

In relation to these known methods characterized in Table 1 below, an arrangement according to the present invention enables considerably higher compression, more effective multiplexing, and more effective controlling of multimedia data streams.

TABLE 1 Examples of playback time for multimedia memory solutions based on ITU H.324 Playback time Playback time Playback time at 32 kbit/s at 128 kbit/s at 512 kbit/s QCIF CIF ITU-R 601 (180 × 144) (360 × 288) (720 × 576) Image Image Image resolution at resolution at resolution at Memory H.263; G.723; H.263; H.263; Volume G4kbit/s G.723; G.728; G.728, G.729; Medium: (MByte): audio G.729 audio G.722 audio Diskette: 1.4 About 6 — — minutes CD-ROM 660 about 46 about 11 about 2.5 hours hours hours Example 10 about 41 about 10 — Data minutes minutes File: Hard — about 240 about 1 about 4 Disk/ kbyte/min. Mbyte/min mbte/min Minute Example 540 about 38 about 9 hours about 2.25 Hard hours hours Disk:

The quality of MPEG-1 Video and MPEG-1 Audio is indeed as a rule better than the other methods listed in Table 1, but there are numerous applications in which a reduced video and audio quality is completely sufficient (e.g. multimedia mail, video images with head and shoulders, multimedia lexicons).

SUMMARY OF THE INVENTION

The present invention provides a method and apparatus for the storing, searching and playback of audiovisual data files.

To that end, in an embodiment, the invention provides a method for storing, searching and playback of highly compressed audiovisual items of information and data files of a multimedia memory device, using a multimedia multiplexing and multimedia control protocol, with the following steps:

a) controlling multimedia streams of information in a first separate virtual control channel according to ITU-T H.245 in order to enable flexible allocation and simultaneous processing of several audio/speech, video and data channels for multimedia communication;

b) multiplexing, or, respectively, demultiplexing video items of information and/or audio/speech items of information and/or data items of information and/or control information according to ITU-T H.223 in order to enable flexible allocation of channel capacities corresponding to the current needs of the channels allocated in the preceding step;

c) compressing and coding, or, respectively, decompressing or decoding video signals according to ITU-T H.263;

d) compressing, or, respectively, decompressing the audio or, respectively, speech signals using a high-compression speech compression algorithm; and

e) controlling the multimedia storage device via a second separate virtual control channel.

In an embodiment, the invention provides an apparatus for storing, searching and playing back of highly compressed audiovisual items of information and data files of a multimedia memory apparatus, using a multimedia multiplexing and multimedia control protocol, comprising: an information stream control for controlling the multimedia information streams in a separate virtual control channel according to ITU-T H.245, thereby enabling a flexible allocation and the simultaneous processing of several audio/speech channels, video channels and data channels for multimedia communication; a multiplexer and demultiplexer for multiplexing or, respectively, demultiplexing of video items of information and/or audio/speech items of information and/or data items of information and/or control information according to ITU-T H.223, thereby enabling a flexible allocation of channel capacities corresponding to the current needs of the channels allocated by the named controlling; a video compression and coding means for the compression and coding of video signals, and with a video decompression and decoding means for the decompressing and decoding of video signals; an audio/speech compression and audio/speech decompression means for the compression or, respectively, decompression of audio signals or, respectively, speech signals with a high-compression speech compression algorithm; and a device control for controlling the multimedia memory device via a further separate logical control channel.

In an embodiment, the invention provides an apparatus in which the second separate virtual control channel is an additionally opened virtual data channel according to ITU-T H.245.

In an embodiment, the invention provides an apparatus in which the audio-speech compression and audio/speech decompression means can be operated at least with a speech compression algorithm according to ITU-T G.723.1, ITU-T G.729, ITU-T G.728, ITU-T G.722, ISO/IEC 11172-3, or according to ITU-T G.4 kbit/s.

In an embodiment, the invention provides an apparatus in which the multimedia storage device is realized by a computer with a magnetic hard disk memory.

In an embodiment, the invention provides an apparatus in which the multimedia memory device is a computer with a read-only optical memory means.

In an embodiment, the invention provides an apparatus in which the multimedia storage device is a computer with a write/read optical memory means.

Depending on the individual embodiments and special features, the invention makes use as needed of the following information technology standards and/or communication technology standards:

The present standardization of speech coders in the ITU-T with very low bit rates for videotelephony (ITU-T G.723) in the public telephone dialing network (GSTN) leads to qualitatively good speech coders (approximating the quality of the recommendation CCITT G.726), with a transmission speed of 5.3 to 6.3 kbit/s. The ITU-T G.729 speech coder also enables digital speech transmission with a speed of 8 kbit/s. In the future, a 4 kbit/s coder will also be standardized (ITU-T G.4 kbit/s). The codecs are currently the most efficient speech codecs.

The present ITU-T standardization of moving image coders with very low bit rates, e.g., for videotelephony in the public telephone dialing network (ITU-T H.263), leads to qualitatively good moving image coders (QCIF resolution 180×144 and lower) with the minimum required transmission speed of 8-24 kbit/s (or higher), which require a secured type of transmission (e.g., with ITU-T H.223). An increase in the image resolution via the values defined in the standard, e.g., to CIF (360×288) or ITU-T 601 (720×576) enables the transmission of moving images with television images or, respectively, moving images according to the resolution of the digital studio standard ITU-R 601.

The present standardization in the ITU-T of multiplexing of audiovisual types of data with very low bit rates, e.g., for videotelephony in the public telephone dialing network with a transmission speed of 9.6-32 kbit/s (and higher), which enables a secured type of transmission (according to ITU-T H.223). This principle can also be used for memory systems.

The present ITU-T standardization (ITU-T H.245) relating to the control of audiovisual types of data with very low bit rates, e.g., for videotelephony in the public telephone dialing network, which enables a flexible allocation of up to 15 independent useful channels, respectively with audio/speech information, video information or data information. This principle can also be used for memory systems. Each channel is provided with a flexible bandwidth, which can vary arbitrarily from application to application in the running of the memory application.

These and other features of the invention are discussed in greater detail below in the following detailed description of the presently preferred embodiments with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic diagram of an exemplary embodiment of a multimedia memory system according to the invention.

FIG. 2 illustrates a multimedia multiplexing scheme.

DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS

The various standards referred to herein are incorporated by reference.

A multimedia memory system consists of several functional units. The video I/O-devices (input/output) typically contain e.g. a camera, a display screen and an image preparation unit for the mixing of several images (split-screen). In an advantageous construction of the invention, all the devices can actually be connected. Audio/speech I/O devices include a microphone (or several), one or more loudspeakers and an audio/speech preparation unit (e.g. for echo suppression). Here as well, in an advantageous construction, all the devices can actually be connected.

The system control controls the overall system, i.e., it provides for the multimedia control for the multimedia multiplexer and for the overall multimedia system control. The video codec provides for the digital compression and decompression of the video signal in the video coder or, respectively, video decoder. The audio/speech codec provides for the digital compression/decompression of the audio/speech signal. An optional delay of the speech signal can be carried out in order to achieve lip synchronization between video and speech. Given multimedia storage, the multiplexer/demultiplexer merges the audio signals, video signals and data signals in one common data stream, or, respectively, given retrieval from the multimedia data bank, separates the common multimedia data stream into separate audio, video and data signals.

The system control consists of the controlling of the multimedia multiplexing (according to ITU-T H.245) and of the overall multimedia memory system (e.g. loading/termination of the multimedia memory program in the computer).

In multimedia multiplexing according to ITU-T H.245, up to 15 useful channels can be opened and used. According to H.245, before the storing of the useful channels, the user parameters are settled on and are set. The data memory thereby indicates the broadest possibility of storing of items of multimedia information, and the storing application makes the final decision and selects which multimedia channels are to be opened and how the multimedia storing in the data memory is to take place.

For the storing of an audiovisual message (voice mail), one channel for video, one channel for speech, a data channel for the voice mail control (addressing, time of the storing of the voice mail, etc.) and the obligatory ITU-T H.245 multimedia control channel could for example be opened.

For the storing of an audiovisual German-French language course, one channel for video, one channel each for sound in German and in French, one data channel each for the German and for the French accompanying text, one data channel for the language course program control, and the obligatory ITU-T H.245 multimedia control channel could for example be opened.

For the storing of a sports film sequence (e.g., soccer), one channel could for example, first be allocated to for sound and one channel to for image. Following a goal, for example, five channels could for example, be allocated in the short term for video. A different camera position is assigned to each channel. By this means, during playback the user could dynamically select an arbitrary camera position (e.g., from above, from the goal perspective, from behind, or from the side).

Before the playback of the useful channels, the user parameters are also settled on and are set according to ITU-T H.245. The data memory thereby indicates the broadest possible storing of multimedia items of information. Here as well, the playback application (i.e., the multimedia memory system) makes the final selection of the multimedia channels to be opened, and determines how the multimedia playback from the data memory is to take place.

FIG. 2 illustrates the multimedia multiplexing. The lowest layer PS is the physical layer. This is realized in the computer bus, the interface between the external digital memory (CD-ROM, hard disk, etc.), and the multimedia multiplexing. The multiplexer (similar to ITU-T H.223, with the difference that multimedia data are provided not for an analog telephone network, but rather for the bus of a computer) is provided with two layers: what is known as an adaptation layer AL and a multiplex layer ML. The adaptation layer AL is responsible for the adaptation of the diverse streams of information (which come from the different media sources (video, audio/speech, data)) to the multiplex layer ML.

In FIG. 2, four adaptation layers AL are specified: a data adaptation layer DAL, an audio/speech adaptation layer AuAL, a video adaptation layer VAL and a control adaptation layer CAL for the transmission of multimedia control data. In the multiplex layer ML, each adaptation layer makes use of the services of what is called a convergence sublayer CS and what is called a segmentation/assembly sublayer SARS. The convergence sublayer CS provides for error recognition and for error correction. The segmentation and reassembly sublayer SARS provides for the fragmentation of the data streams into what are known as SAR-SDUs (SDU—service data unit), tailored to the multiplex layer ML.

The video codec (video), which codes or, respectively, decodes the video items of information, is located above the video adaptation layer AL. The audio codec (audio), which codes or, respectively, decodes the audio items of information, is located above the audio adaptation layer AL. The data protocols necessary for the data application are located above the data adaptation layer AL (data). A special data channel is allocated to the ITU-T H.245 multimedia control protocols.

The adaptation layers AL display transmission errors during storage, and error corrections are initiated. The adaptation layer AL further fragments the information streams into smaller units.

The multiplex layer ML provides for the multiplexing of the various types of information that are prepared by the adaptation layers AL. During access/playback, the multiplex layer ML provides for just demultiplexing of the arrived data stream into data fragments of the various types of information that are forwarded to the respectively responsible adaptation layer AL. The adaptation layers AL assemble the individual data streams from the data fragments, which streams are forwarded to the applications (speech/audio, video, data, multimedia control).

Although modifications and changes may be suggested by those skilled in the art, it is the intention of the inventors to embody within the patent warranted hereon all changes and modifications as reasonably and properly come within the scope of their contribution to the art. 

What is claimed is:
 1. A method for storing, search and playback of highly compressed audiovisual items of information and data files of an electronic multimedia memory device, using a multimedia multiplexing and multimedia control, comprising the following steps: a) controlling multimedia streams of information in a first separate virtual control channel according to ITU-T H.245 in order to enable flexible allocation and simultaneous processing of several audio/speech, video and data channels for multimedia communication; b) multiplexing or, respectively, demultiplexing video items of information and/or audio/speech items of information and/or data items of information and/or control information according to ITU-T H.223 in order to enable flexible allocation of channel capacities corresponding to the current needs of the channels allocated in the preceding step; c) compressing and coding, or, respectively, decompressing and decoding video signals; d) compressing or, respectively, decompressing the audio or, respectively, speech signals using a high-compression speech compressing algorithm; and e) controlling the electronic device via a second separate virtual control channel.
 2. An apparatus for storing, searching and playback of highly compressed audiovisual items of information and data files of an electronic multimedia memory apparatus, using a multimedia multiplexing and multimedia control, comprising: an information stream control for controlling the multimedia information streams in a separate virtual control channel according to ITU-T H.245, thereby enabling a flexible allocation and the simultaneous processing of several audio/speech channels, video channels and data channels for multimedia communication; a multiplexer and demultiplexer for multiplexing or, respectively, demultiplexing of video items of information and/or audio/speech items of information and/or data items of information and/or control information according to ITU-T H.223, thereby enabling a flexible allocation of channel capacities corresponding to the current needs of the channels allocated by the named controlling; a video compression and coding means for the compression and coding of video signals, and with a video decompression and decoding means for the decompressing and decoding of video signals; an audio/speech compression and audio/speech decompression means for the compression or, respectively, decompression of audio signals or, respectively, speech signals with a high-compression speech compression algorithm; and a device control for controlling the multimedia memory device via a further separate logical control channel.
 3. An apparatus according to claim 2, characterized in that the second separate virtual control channel is an additionally opened virtual data channel according to ITU-T H.245.
 4. An apparatus according to claim 2, characterized in that the audio/speech compression and audio/speech decompression means can be operated at least with a speech compression algorithm according to a standard selected from the group consisting of ITU-T G.723.1, ITU-T G.729, ITU-T G.728, ITU-T G.722, ISO/IEC 11172-3 and ITU-T G.4 kbit/s.
 5. An apparatus according to claim 2, characterized in that the multimedia storage device is realized by a computer with a magnetic hard disk.
 6. An apparatus according to claim 2, characterized in that the multimedia memory device is a computer with a read-only optical memory means.
 7. An apparatus according to claim 2, characterized in that the multimedia storage device is a computer with a write/read optical memory means.
 8. An apparatus according to claim 3, characterized in that the audio/speech compression and audio/speech decompression means can be operated at least with a speech compression algorithm according to a standard selected from the group consisting of ITU-T G.723.1, ITU-T G.729, ITU-T G.728, ITU-T G.722, ISO/IEC 11172-3 and ITU-T G.4kbit/s.
 9. An apparatus according to claim 3, characterized in that the multimedia storage device is realized by a computer with a magnetic hard disk.
 10. An apparatus according to claim 4, characterized in that the multimedia storage device is realized by a computer with a magnetic hard disk.
 11. An apparatus according to claim 3, characterized in that the multimedia memory device is a computer with a read-only optical memory means.
 12. An apparatus according to claim 4, characterized in that the multimedia memory device is a computer with a read-only optical memory means.
 13. An apparatus according to claim 3, characterized in that the multimedia storage device is a computer with a write/read optical memory means.
 14. An apparatus according to claim 4, characterized in that the multimedia storage device is a computer with a write/read optical memory means. 